.  *'#  «  f.r«.l  tf.l'l!.*  <41  fat 


M  lal  f4  « J  M  MM.  «.< 


LABORATORY  FOR 
COMPUTER  SCIENCE 


*_*  «il  »■!  «.|  i.ljjft  V*  l.»U  Ht.«alL»A.*«L*«).»tl 


MASSACHUSETTS 
INSTITUTE  OF 
TECHNOLOGY 


MIT/LCS/TM-356  *HK  FILE  COPY 


PARALLEL  SYMMETRY-BREAKING 
IN  SPARSE  GRAPHS 


Andrew  V.  Goldberg 
Serge  A.  Plotkin 
Gregory  E.  Shannon 


dtic 

.  _OTE| 
'  AUS1  619881 

I  <*_ 


May  1988 


545  TECHNOLOGY  SQUARE,  CAMBRIDGE,  MASSACHUSETTS  02139 

Til»  daetuo«mt  ka»  I 

tot  pnhlk*  nlMM  n4  gg  I 
dtaWbotloa  ta  nHnrfnd.  ’-jppf 

-  -Q  C  C 


* 


1$, 

iw 


m 


<!*.»!»  I.M,'  ,. 


tft.  >r.  A  .:n 


fW  »i| 


REPORT  DOCUMENTATION  PAGE 

la.  REPORT  SECURITY  CLASSIFICATION 

Unclassified 

lb  RESTRICTIVE  MARKINGS 

2a.  SECURITY  CLASSIFICATION  AUTHORITY 

3.  DISTRIBUTION /AVAILABILITY  OF  REPORT 

2b.  DECLASSIFICATION /DOWNGRADING  SCHEDULE 

Approved  for  public  release;  distribution 
is  unlimited. 

4.  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 

MIT/LCS/TM-356 

S.  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 

N00014-80-C-0622 ,  N00014-86-K-0689 

6a.  NAME  OF  PERFORMING  ORGANIZATION 

MIT  Laboratory  for  Computer 
Science 

6b.  OFFICE  SYMBOL 
(If  applicable) 

7a.  NAME  OF  MONITORING  ORGANIZATION 

Office  of  Naval  Research/Department  of  Navy 

|  6c  ADDRESS  (City.  State,  and  ZIP  Code) 

545  Technology  Square 

Cambridge,  MA  02139 

|  7b.  ADDRESS  (Oty,  State,  and  ZIP  Code) 

Information  Systems  Program 

Arlington,  VA  22217 

8«.  NAME  OF  FUNDING  /SPONSORING  8b.  OFFICE  SYMBOL 

ORGANIZATION  (tf  applicable) 

DARPA/DOD 


8c  AOORESS  (Crty,  State,  and  ZIP  Code) 
1400  Wilson  Blvd. 
Arlington,  VA  22217 


1 1 .  TITLE  (Indude  Security  Classification) 

Parallel  Symmetry-Breaking  in  Sparse  Graphs 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 


10.  SOURCE  OF  FUNDING  NUMBERS 


PROGRAM 
ELEMENT  NO. 


PROJECT 

NO. 


12.  PERSONAL  AUTHORS) 

Goldberg.  Andrew  V. .  Plotkin.  Serge  A 


13a.  TYPE  OF  REPORT  13b.  TIME  COVERED 

Technical  from _ to 


16.  SUPPLEMENTARY  NOTATION  __  ,  > 


and  Shannon,  Gregory  E, 


14.  DATE  OF  REPORT  [Year.  Month,  Oay)  IlS.  PAGE  COUNT 

1988  May  I  23 


17.  COSATI  CODES 


FIELD  GROUP  SUB-GROUP 


18.  SUBJECT  TERMS  [Continue  on  reverse  if  necessary  and  identify  by  block  number) 

Symmetry-breaking,  parallel  algorithms,  graph  coloring, 
maximal  independent  set,  planar  graphs  .  -  ,  . 


19.  ABSTRACT  (Continue  on  reverse  if  necessary  and  identify  by  block  number)  ;  j 

V 

We  describe  efficient  deterministic  techniques  for  breakirig  symmetry  in  parallel,  These 
techniques  work  well  on  rooted  treees  and  graphs  of  constant  degree  or  genus.  ©tlr 
primary  technique  allows  us  to  3-color  a  rooted  tree  in  0(lg*n)  time  on  an  EREW  PRAM 
using  a  linear  number  of  processors.  We  use  these  techniques  to  construct  fast  linear 
processor  algorithms  for  several  problems,  including  the  problem  of  (&+  l)-coloring 
constant-degree  graphs  and  5-coloring  planar  graphs.  We  also  prove  lower  bounds  for  2- 
coloring  directed  lists  and  for  finding  maximal  independent  sets  in  arbitrary  graphs. 


20.  DISTRIBUTION /AVAILABILITY  OF  ABSTRACT  |21.  ABSTRACT  SECURITY  CLASSIFICATION 

Q UNCLASSIFIEO/UNLIMITED  □  SAME  AS  RPT.  □  DTIC  USERS  1  Unclassified _ 


22a.  NAME  OF  RESPONSIBLE  INDIVIDUAL  22b.  TELEPHONE  (include  Area  Code)  |  22c.  OFFICE  SYMBOL 

Judy  Little,  Publications  Coordinator  (617)  253-5894  I 


DO  FORM  1473, 84  MAR  83  APR  edition  may  be  used  until  exhausted.  SECURITY  CLASSIFICATION  OF  THIS  PAGE 

All  other  editions  are  obsolete  "  "  " 


Unclassified 


Parallel  Symmetry-Breaking  in  Sparse  Graphs* 


Andrew  V.  Goldberg* 
Dept,  of  Computer  Science 
Stanford  University 
Stanford,  CA  94305 


Serge  A.  Plotkin* 

Lab.  for  Computer  Science 
M.I.T. 

Cambridge,  MA  02139 


Gregory  E.  Shannon5 
Computer  Sciences  Dept. 

Purdue  University 
West  Lafayette,  IN  47907 

OTIC  TAB 
Unanaonaoxl 
JUfttiflMtlML. 


□ 


Distribution/ 
Availability  Codes 
[Avaft.  and/or 
|Dlst  |  Speolal 


Abstract 


We  describe  efficient  deterministic  techniques  for  breaking  symmetry  in  parallel. 
These  techniques  work  well  on  rooted  trees  and  graphs  of  constant  degree  or  genus. 
Our  primary  technique  allows  us  to  3-color  a  rooted  tree  in  0(lg*n )  time  on  an  EREW 
PRAM  using  a  linear  number  of  processors.  We  use  these  techniques  to  construct  fast 
linear  processor  algorithms  for  several  problems,  including  the  problem  of  (A  +  1)- 
coloring  constant-degree  graphs  and  5-coloring  planar  graphs.  We  also  prove  lower 
bounds  for  2-coloring  directed  lists  and  for  finding  maximal  independent  sets  in  arbi¬ 
trary  graphs. 

Keywords:  :  Symmetry-breaking,  parallel  algorithms,  graph  coloring,  maximal  inde¬ 
pendent  set,  planar  graphs. 


_ _ ^ 

njfi 

I  h; 

1 

*A  preliminary  version  of  this  paper  appeared  in  the  Proc.  of  the  19<A  ACM  Symp.  on  Theory  of 
Computing,  May  1987. 

tPart  of  this  research  was  done  while  the  author  was  with  at  Lab.  for  Computer  Science,  MIT,  Cambridge. 
Supported  in  part  by  a  Fannie  and  John  Hertz  Foundation  Fellowship  and  DARPA  Contract  N00014-80-C- 
0622. 

^Supported  by  DARPA  Contract  N00014-80-C-0622.  Part  of  the  work  was  done  while  the  author  was  at 
AT&T  Bel!  Laboratories,  Murray  Hill,  NJ  07974. 

^Supported  in  part  by  Hewlett-Packard’s  Faculty  Development  Program,  NSF  Grant  DCR-8320124,  and 
ONR  Contract  N000 14-86- K-0689. 


1  Introduction 


Some  problems  for  which  trivial  sequential  algorithms  exist  appear  to  be  much  harder  to 
solve  in  a  parallel  framework.  When  converting  a  sequential  algorithm  to  a  parallel  one, 
at  each  step  of  the  parallel  algorithm  we  have  to  choose  a  set  of  operations  which  may 
be  executed  in  parallel.  Often,  we  have  to  choose  these  operations  from  a  large  set  of 
symmetrical  operations,  where  interdependencies  prevent  simultaneous  execution  of  all  the 
operations  in  the  set.  Symmetry-breaking  techniques  enable  the  algorithm  to  select  a  large 

subset  of  independent  operations. 

-1  i?TC* 

'«■  PfrriKttgda  maximal  independent  set  (MIS)  of  a  graph  is  a  good  example  of  the  necessity 
Qi  -Symmetry-breaking-  At  any  step,  a  parallel  MIS  algorithm  might  have  many  candidate 
nodes  to  add  to  the  independent  set.  However,  due  to  adjacency  constraints,  not  all  of  these 

’  nodes  can  be  added  simultaneously.  A  symmetry-breaking  technique  is  therefore  needed  to 

n  -  j 

fill'd  a  large  set,  of  nodes  to  add,  as  has  been  done  in  previous  parallel  MIS  algorithms  [GS87, 
KW85,  Lub86j. 

Previous  symmetry-breaking  techniques  have  focused  on  randomization.  It  is  often  de¬ 
sirable,  however,  to  have  a  deterministic  algorithm.  Karp  and  Wigderson  [KW85],  and  Luby 
[Lub86]  proposed  methods  to  convert  certain  randomized  algorithms  into  deterministic  ones. 
Their  methods,  however,  significantly  increase  the  number  of  processors  used. 

In  many  cases  it  is  sufficient  to  break  symmetry  in  sparse  graphs.  In  this  paper  we 
introduce  deterministic  symmetry-breaking  techniques  for  sparse  graphs  that  use  a  linear 
number  of  processors.  Our  primary  technique  allows  us  to  3-color  a  rooted  tree  in  0(lg*n) 
time  on  a  CREW  PRAM.  This  technique  was  motivated  by  the  deterministic  coin-flipping 
technique  developed  by  Cole  and  Vishkin  [CV86]. 

We  use  our  techniques  to  develop  the  linear-processor  algorithms  listed  below. 

•  For  graphs  whose  maximum  degree  is  A,  we  give  an  £?((lg  A)(A2  +  lg*n))-time  EREW 
PRAM  algorithm  for  (A+ 1  )-coloring  and  for  finding  a  maximal  independent  set. 

•  For  planar  graphs,  we  give  7-coloring,  MlS,  and  maximal  matching  algorithms  that 
run  in  O(lgn)  time  on  a  CRCW  PRAM  and  in  0(lg2n)  time  on  an  EREW  PRAM. 


We  give  an  0(lgnlg*n)-time  CRCW  PRAM  algorithm  for  5-coloring  an  embedded 
planar  graph. 


The  above  results  improve  the  running  time  and  processor  bounds  for  the  respective 
problems.  The  best  deterministic  linear-processor  algorithm  for  finding  MIS  [GS87]  runs  in 
0(lg4  n)  time  on  constant-degree  graphs,  compared  to  0(lg*n)  time  of  our  algorithm.  The 
5-coloring  algorithms  for  planar  graphs  described  in  [BK87,  Nao86]  use  0(lg3n)  time  and 
the  same  (large)  number  of  processors  as  needed  by  Luby’s  MIS  subroutine  [Lub86].  The 
0(lg3n)  running  time  of  the  maximal  matching  algorithm  due  to  Israeli  and  Shiloach  [IS86] 
can  be  reduced  to  0(lg2  n)  in  the  restricted  case  of  planar  graphs,  but  our  algorithm  is  faster. 

Although  in  this  paper  we  have  limited  ourselves  to  the  application  of  our  techniques  for 
the  design  of  parallel  algorithms  for  the  PRAM  model  of  computation,  the  same  techniques 
can  be  applied  in  a  distributed  model  of  computation  [Awe85,  GHS83].  Moreover,  the 
Q(lg*n)  lower  bound  for  the  MIS  problem  on  a  chain  in  the  distributed  model  implies  that 
our  symmetry-breaking  technique  is  optimal  in  this  model  [Awe87,  Lin87]. 

Since  we  can  3-color  a  rooted  tree  in  0(lg*n)  time,  it  is  natural  to  ask  if  a  rooted  tree  can 
be  2-colored  as  quickly.  We  answer  this  question  by  giving  an  f7(lgn/lglgn)  lower  bound 
for  2-coloring  of  a  rooted  tree.  We  also  present  an  f!(lgn/lglgn)  lower  bound  for  finding 
a  maximal  independent  set  in  a  general  graph,  thus  answering  the  question  posed  by  Luby 
[Lub86]. 

This  paper  is  organized  as  follows.  In  Section  2  we  present  definitions,  notation,  and 
computation  model  details.  In  Section  3  we  present  the  algorithm  for  3-coloring  rooted 
trees.  In  Section  4  we  use  this  algorithm  to  ( A  +  l)-color  constant-degree  graphs.  In  Section 
5  we  use  results  of  Section  4  to  develop  algorithms  for  planar  graphs.  In  Section  6  we  prove 
the  lower  bounds  mentioned  earlier. 
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2  Preliminaries 

This  section  describes  the  assumptions  about  the  computational  model  and  introduces  the 
notation  used  throughout  the  paper.  We  consider  simple,  undirected  graphs  with  n  vertices 
and  m  edges.  The  maximum  degree  of  a  graph  is  denoted  by  A.  The  graph  induced  by  a 
set  of  nodes  X  is  denoted  by  G{ X]. 

Given  a  graph  G  =  (V,  E ),  we  say  that  a  subset  of  nodes  I  C  V  is  independent  if  no  two 
nodes  in  I  are  adjacent.  A  coloring  of  a  graph  G  is  an  assignment  C  :  V  — >  I+  U  {0}  of 
nonnegative  (not  necesseraly  consequitive)  integers  (colors)  to  nodes  of  the  graph.  A  coloring 
is  valid  if  no  two  adjacent  nodes  have  the  same  color.  The  bits  are  numbered  from  0,  and  the 
*th  bit  in  the  color  of  a  node  v  is  denoted  by  Cv(i).  A  subset  of  edges  M  C  E  is  a  matching 
if  each  pair  of  distinct  edges  in  M  have  no  nodes  in  common. 

The  following  problems  are  discussed  in  the  paper: 

•  The  node-coloring  problem:  find  a  valid  coloring  of  a  given  graph  that  uses  at  most 
A-f  1  colors. 

•  The  maximal  independent  set  (MIS)  problem:  find  a  maximal  independent  set  of 
vertices  in  a  given  graph. 

•  The  maximal  matching  (MM)  problem:  find  a  maximal  matching  in  a  given  graph. 

We  make  a  distinction  between  unrooted  and  rooted  trees.  In  a  rooted  tree,  each  nonroot 
node  knows  which  of  its  neighbors  is  its  parent. 

The  following  notation  is  used: 

lg  X  =  log2  X 

lg(1)x  =  lg  X 

Igh)  x  =  lglgh-1)  x 

lg*x  =  min{i|  lg(,)  x  <  2} 

We  assume  a  PRAM  model  of  computation  [B1I85,  FW78]  where  each  processor  is  capable 
of  executing  simple  word  and  bit  operations.  The  word  width  is  assumed  to  be  O(lgn).  The 
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Procedure  6- Color- Rooted- Tree(T) 

Nc  4  'O'i 

for  all  v  £  V  in  parallel  do  Cv  *—  PE-ID(t>); 
while  Nc  >  6  do 

for  all  v  £  V  in  parallel  do  begin 
if  v  is  the  root  then  begin 
iv  *—  0; 
bv  *—  Ct,(0); 
end; 

else  begin 

iv  <  min{i  |  C„(f)  ^  Cparent(ti)(0}i 
by  Cy(iy); 

end; 

Cy  *  iyby  J 

end; 

Nc  <-  max{C„  |  v  €  V}  +  1; 

end; 

end. 


Figure  1:  The  Coloring  Algorithm  for  Rooted  Trees 

word  operations  we  use  include  bit-wise  boolean  operations,  integer  comparisons,  and  unary- 
to-binary  conversion.  Each  processor  P  has  a  unique  identification  number  0(lg  n)  bits  wide, 
which  we  denote  by  PE-ID(P).  We  use  adjacency  lists  to  represent  the  graph,  assigning 
a  processor  to  each  edge  and  each  node  of  the  graph.  We  use  exclusive-read  exclusive- 
write  (EREW)  PRAM,  concurrent-read  exclusive-write  (CREW)  PRAM,  and  concurrent- 
read  concurrent-write  (CRCW)  PRAM,  as  appropriate.  The  write  conflicts  in  CRCW  PRAM 
are  assumed  to  be  resolved  arbitrarily.  All  lower  bounds  are  proven  for  a  CRCW  PRAM 
with  a  polynomial  number  of  processors. 


3  Coloring  Rooted  Trees 

This  section  describes  an  0(lg*n)-time  algorithm  for  3-coloring  rooted  trees.  We  first  describe 
an  0(lg*«)-time  algorithm  for  (i-coloring  rooted  trees  and  then  show  how  to  transform  a  6- 
coloring  °f  3-  rooted  tree  into  a  3-coloring  in  constant  time. 
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The  procedure  6-Color-Rooted-Trecis  shown  in  Figure  1.  This  procedure  accepts  a  rooted 
tree  T  =  ( V ,  E)  and  6-colors  it  in  time  0(lg*n).  Starting  from  the  valid  coloring  given  by  the 
processor  ID’s,  the  procedure  iteratively  reduces  the  number  of  bits  in  the  color  descriptions 
by  recoloring  each  nonroot  node  v  with  the  color  obtained  by  concatenating  the  index  of  a 
bit  in  which  Cv  differs  from  CpaTent(v)  and  the  value  of  this  bit.  The  root  r  concatenates  0 
and  Cr[0]  to  form  its  new  color. 

Theorem  1  The  algorithm  6-Color-Rooted-Tree  produces  a  valid  6-coloring  of  a  tree  in 
0(lg*n)  time  on  a  CREW  PRAM  using  a  linear  number  of  processors. 

Proof:  First  we  prove  by  induction  that  the  coloring  computed  by  the  algorithm  is  valid, 
and  then  we  prove  the  upper  bound  on  the  execution  time. 

Assuming  that  the  coloring  C  is  valid  at  the  beginning  of  an  iteration,  show  that  the 
coloring  at  the  end  of  the  iteration  is  also  valid.  Let  v  and  w  be  two  adjacent  nodes  with 
v  being  the  parent  of  w.  In  the  algorithm,  w  chooses  some  index  i  such  that  Cv(i)  /  Cw(i) 
and  v  chooses  some  index  j  such  that  Cv(j)  ±  Cparent(v)(j).  The  new  color  of  w  is  (i,Cw{i)) 
and  the  new  color  of  v  is  (j,  Cv(j)).  If  i  yf  j,  the  new  colors  are  different  and  we  are  done. 
On  the  other  hand,  if  i  =  j,  then  Cv{i)  can  not  be  equal  to  Cw(i)  by  the  definition  of  i,  and 
again  the  colors  are  different.  Hence,  the  validity  of  the  coloring  is  preserved. 

Now  we  show  that  the  algorithm  terminates  after  0(lg*n)  iterations.  Let  Lk  denote  the 
number  of  bits  in  the  representation  of  colors  after  k  iterations.  For  k  =  1  we  have 

Li  =  fig  I]  +  1 
<  2  fig  L] 

if  fig  L]  >  1. 

Assume  for  some  k  we  have  Lk_x  <  2[lg(fc  l)  L]  and  flg(fc)  L]  >  2.  Then 

Lk  —  fig  Lk-i]  -f  1 

<  rig^ig^zoi  +  i 

<  2flg(*>//| 


Therefore,  as  long  as  flg^  L]  >  2, 


Lk  <  ‘2flg(fc)  /,]. 


Hence,  the  number  of  bits  in  the  representation  of  colors  Lk  decreases  until,  after  0(lg*n) 
iterations,  flg^  L]  becomes  1  and  Lk  reaches  the  value  of  3  (the  solution  of  L  =  fig  L~\  +  1). 
Another  iteration  of  the  algorithm  produces  a  6-coloring:  3  possible  values  of  the  index  iv 
and  2  possible  values  of  the  bit  bv.  The  algorithm  terminates  at  this  point. 

Using  concurrent-read,  each  node  determines  its  parent’s  color  in  constant  time.  Given 
two  colors,  Cv  and  Cw,  we  can  compute  the  smallest  index  j  such  that  the  }-th  bit  of  Cv  differs 
from  the  j-th  bit  of  Cw  by  computing  j  =  unary-to-binary(|Cv  —  Cw |  XOR  (| Cv  —  Cw  \  —  1)). 
Hence,  each  node  can  compute  the  new  color  independently  in  constant  time.  Therefore, 
each  iteration  takes  constant  time  and  the  algorithm  uses  0(lg*n)  time  overall.  Note  that 
no  concurrent-write  capabilities  are  required;  for  constant- degree  trees  the  concurrent-read 
capability  is  not  needed  either.  | 

We  now  describe  the  algorithm  3-Color-Rooted-Tree  which  3-colors  a  rooted  tree.  The 
algorithm  first  applies  6-Color-Rooted-Tree  to  produce  a  valid  6-coloring  of  the  tree.  Then 
it  executes  three  stages,  each  time  reducing  the  number  of  colors  by  one. 

Each  stage  works  as  follows.  By  shifting  down  the  coloring  we  mean  recoloring  each 
nonroot  node  with  the  color  of  its  parent  and  recoloring  the  root  with  a  color  different  from 
its  current  color.  To  remove  the  color  c  €  {3,4,5},  first  shift  down  the  current  coloring. 
Then,  recolor  each  node  of  color  c  with  the  smallest  color  different  from  its  parent’s  and 
children’s  colors. 

Theorem  2  Given  a  rooted  tree  T ,  the  algorithm  3-Color-Rooted-Tree  constructs  a  valid 
3-coloring  of  T  using  n  processors  and  0(lg*n)  time  on  a  CREW  PRAM. 

Proof:  After  a  shift  of  colors,  the  children  of  any  node  have  the  same  color.  Thus  each  node 
is  adjacent  to  nodes  of  at  most  two  different  colors.  Therefore,  each  stage  of  the  algorithm 
reduces  the  number  of  colors  by  one,  as  long  as  the  number  of  colors  is  greater  than  three. 
Each  stage  takes  a  constant  time  on  a  CREW  PRAM.  The  theorem  follows  from  Theorem 

1-  I 

To  describe  the  subsequent  algorithms,  we  introduce  the  concept  of  a  pseudoforest  [PQS2]. 
A  pscudoforest  ol  G  —  (V,  E)  is  a  directed  graph  G'  =  (V,  E'),  such  that.  (»/,»>)  <E  E'  => 


'“T" 


{u,  u}  G  E  and  outdegree  of  any  node  is  at  most  one.  A  maximal  pseudoforest  of  G  —  ( V. ,  E) 
is  a  directed  graph  G'  =  (V,E'),  such  that  (u,u)  G  E'  {u,u}  G  E  and  outdegree  of  any 
node  in  G'  is  one,  unless  this  node  is  zero-degree  in  G.  Nodes  with  zero  out-degree  are 
roots  of  the  pseudoforest.  We  assume  that  graphs  are  represented  by  adjacency  lists,  and 
therefore  a  maximal  pseudoforest  can  be  constructed  in  (parallel)  constant  time  by  choosing 
an  arbitrary  adjacent  edge  for  every  node  and  directing  this  edge  outward. 


The  coloring  algorithms  presented  in  this  section  work  for  pseudoforests  as  well  as  for 
rooted  trees.  Therefore,  a  pseudoforest  can  be  3-colored  in  0(lg*/r)  time  on  an  CRCW  PRAM 
using  a  linear  number  of  processors.  We  shall  call  the  procedure  for  3-coloring  pseudoforests 
3-Color-Pseudoforest.  Note  that  an  odd  cycle  is  a  pseudoforest  that  can  not  be  colored  in  less 
than  3  colors,  and  therefore  the  number  of  colors  used  by  the  procedure  3-Color-Pseudoforest 
is  optimal  in  this  case. 


Any  tree  can  be  2-colored.  In  fact,  it  is  easy  to  2-color  a  tree  in  polylogarithmic  time. 
For  example,  one  can  use  treefix  operations  [LM86,  MR85]  to  compute  the  distance  from 
each  node  to  the  root,  and  color  even  level  nodes  with  one  color  and  odd  level  nodes  with 
the  other  color.  It  is  harder  to  find  a  2-coloring  of  a  rooted  tree  in  parallel,  however,  than  it 
is  to  find  a  3-coloring  of  a  rooted  tree.  In  section  6  we  show  a  lower  bound  of  fl(lgn/lglgn) 
on  2-coloring  of  a  directed  list  on  a  CRCW  PRAM  with  a  polynomial  number  of  processors, 
which  implies  the  same  lower  bound  for  rooted  trees. 


t 

L 


i." 


f. 

/ 

\ 


4  Coloring  Constant-Degree  Graphs 

The  method  for  coloring  rooted  trees,  described  in  the  previous  section,  is  a  generaliza¬ 
tion  of  the  deterministic  coin-flipping  technique  described  in  [CVS6] .  The  method  can  be 
generalized  even  further  [GP87b]  to  color  constant-degree  graphs  in  a  constant  number  of 
colors.  In  the  generalized  algorithm,  a  current  color  of  a  node  is  replaced  by  a  new  color 
obtained  by  looking  at  each  neighbor,  appending  the  index  of  a  bit  in  which  the  current 
color  of  the  node  is  different  from  the  neighbors’  color  to  the  value  of  the  bit  in  the  node 
color,  and  concatenating  the  resulting  strings.  This  algorithm  runs  in  0(lg*?i)  time,  but  the 
number  of  colors,  although  constant  as  a  function  of  n,  is  exponential  in  the  degree  of  the  ► 


P roced ure  Color- Constant- Degree- G raph. 

E'  <-  {( v,w )  |  {v,u'}  €  E}; 

for  t  =  0  to  A  do  begin  ((  the  first  phase)) 

for  all  v  £  V  in  parallel  do 

if  3(u,  u)  e  E'  then  E{  <—  E{  +  ( v ,  u)  ; 

E'  *—  E'  —  E{\  {(  Ei  are  edges  of  a  maximal  pseudoforest  )) 

end; 

for  all  v  €  V  in  parallel  do  ((  initial  coloring)) 

C(v)  <—  0; 

for  all  0  <  i  <  A  in  parallel  do  {(  color  the  pseudoforests)) 

C{  <—  3-Color-Pseudoforest(V,  Ei); 

for  i  <—  A  down  to  0  do  begin  ((  the  second  phase)) 

E'  «-  E'  +  Ei ; 

for  k  <—  1  to  2,  j  *—  0  to  A  do 
V'  V; 

for  all  v  £  V'  in  parallel  do 
if  C(v)  =  j  and  Ci{v)  —  k 

then  begin 

C{v)  <-  niax{{0, 1, . . A}-  {C{w)  |  (o,w)€  E'}}; 

V'  +-  V'  -  {«}; 
end; 

end; 

end; 

end; 

end. 


Figure  2:  The  Coloring  Algorithm  for  Constant  Degree  Graphs 


graph. 

In  this  section  we  show  how  to  use  the  procedure  3-Color-Pseudoforest ,  described  in  the 
previous  section,  to  color  a  constant-degree  graph  with  (A-f  1)  colors. 

The  algorithm  Color-Const  ant- Degree-Graph  which  colors  a  constant-degree  graph  G  = 
( V ,  E)  with  (A+l)  colors  is  presented  in  Figure  2.  The  algorithm  consists  of  two  phases.  In 
the  first  phase  we  iteratively  construct  a  maximal  pseudoforcst  and  remove  its  edges  from 
G.  This  phase  continues  until  no  edges  remain,  at  which  point  we  color  all  the  nodes  with 
one  color.  Then  we  color  all  the  pseudoforests  with  3  colors  in  parallel. 

In  the  second  phase  we  iteratively  return  the  edges  of  the  current  pseudoforest,  each  time 
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recoloring  the  nodes  to  maintain  a  consistent  coloring.  At  the  beginning  of  each  iteration  of 
this  phase,  the  edges  E'  of  the  current  pseudoforest  are  added,  making  the  existing  (A  +  1)- 
coloring  inconsistent.  The  forest  E'  is  already  colored  with  3  colors.  Now,  each  node  has 
two  colors  -  one  from  the  coloring  at  the  previous  iteration  and  one  from  the  coloring  of  the 
forest.  The  pairs  of  colors  form  a  valid  3(A-f-l)-coloring  of  the  graph.  The  iteration  finishes 
by  enumerating  the  color  classes,  recoloring  each  node  of  the  current  color  with  a  color  from 
{0, . . . ,  A}  that  is  different  from  the  colors  of  its  neighbors.  We  can  recolor  all  the  nodes  of 
the  same  color  in  parallel  because  they  are  independent. 

Theorem  3  The  algorithm  Color- Constant-Degree- Graph  colors  the  graph  with  (A-fl)  col¬ 
ors  and  runs  in  0((lg  A)(A2  +  lg*n))  time  on  an  EREW  PRAM  using  a  linear  number  of 
processors. 

Proof :  At  each  iteration  all  edges  of  the  maximal  pseudoforest  are  removed.  The  definition 
of  a  maximal  pseudoforest  implies  that  each  node  that  still  has  neighbors  in  the  beginning 
of  an  iteration  has  at  least  one  edge  removed  during  that  iteration  and  therefore  its  degree 
decreases.  After  at  most  A  iterations,  E'  is  empty.  The  running  time  of  each  iteration 
is  determined  by  the  time  required  to  select  an  unused  edge  out  of  an  edge  list.  On  an 
EREW  PRAM,  an  unused  edge  can  be  selected  in  0(lg  A)  time.  The  pseudoforests  are 
edge-disjoint  and  therefore  can  be  colored  in  parallel.  By  Theorem  2,  this  takes  0(lg  Alg'n) 
time  on  an  EREW  PRAM.  The  IgA  factor  appears  because  we  do  not  use  the  concurrent- 
read  capability;  a  node  must  broadcast  its  color  to  its  children  using,  for  example,  recursive 
doubling.  The  total  time  bound  for  the  first  stage  is  therefore  0((lg  A)(A  +  lg*n)). 

The  second  phase  terminates  in  at  most  A  iterations  as  well.  For  each  pseudoforest  we 
iterate  over  all  the  colors.  Since  in  this  section  we  assume  that  A  is  a  constant,  each  iteration 
can  be  done  in  0(lg  A)  time  using  word  operations;  for  example,  we  can  represent  colors  as  bit 
vectors  and  use  exclusive-or  funct'on  together  with  the  recursive  doubling  technique.  Hence, 
one  iteration  of  the  second  phase  takes  0(Alg  A)  time,  leading  to  an  overall  0((lg  A)(A2  + 
lg*n))  running  time  for  the  second  stage  of  the  algorithm  and  for  the  algorithm  itself.  | 

Given  a  (A  +  l)-coloring  of  a  graph,  we  can  find  an  MIS  of  the  graph  by  iterating 
over  the  colors,  taking  all  the  remaining  nodes  of  the  current  color,  adding  them  to  the 


independent  set,  and  removing  them  and  all  their  neighbors  from  the  graph.  (VVe  refer  to 
this  procedure  as  Constant-Degree- MIS  in  the  subsequent  sections.)  The  running  time  of  this 
algorithm  is  dominated  by  the  running  time  of  the  Color-Constant- Degree- Graph  procedure. 
The  following  theorem  states  this  fact  formally. 
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Theorem  4  An  MIS  in  constant- degree  A  graphs  can  be  found  in  0((lg  A)(A2  +  lg”n))  time 
on  an  EREW  PRAM  using  a  linear  number  of  processors. 

Remask:  The  proofs  of  Theorems  3  and  4  imply  that  the  algorithms  Color- Constant-Degree- 
Graph  and  Constant-Degree-MIS  have  a  polylogarithmic  running  times  for  graphs  with  poly- 
logarithmic  maximum  degrees.  For  graphs  with  arbitrary  maximum  degree  we  can  use  the 
following  algorithm.  First,  the  graph  is  partitioned  into  two  subgraphs  with  approximately 
equal  number  of  nodes,  and  the  subgraphs  are  recursively  colored  in  A-f  1  colors.  Then  we 
iterate  through  all  the  colors  of  one  of  the  subgraphs,  recoloring  each  node  with  a  color  dif¬ 
ferent  from  the  colors  of  all  of  its  neighbors.  We  can  find  this  color  using  sorting  in  0(lg  A) 
time  [Col86].  This  algorithm  colors  a  graph  with  a  maximum  degree  of  A  with  A  +  l  colors 
in  O(AlgAlgn)  time. 

The  above  algorithms  can  be  implemented  in  the  distributed  model  of  computation 
[Awe85,  GHS83],  where  processors  have  fixed  connections  determined  by  the  input  graph. 
The  algorithms  in  the  distributed  model  achieve  the  same  0(lg*n)  bound  as  in  the  EREW 
PRAM  model.  It  was  recently  shown  that  f2(lg"‘n)  time  is  required  in  the  distributed  model 
to  find  a  maximal  independent  set  on  a  chain  [Awe87,  Lin87|.  Our  algorithms  are  therefore 
optimal  (to  within  a  constant  factor)  in  the  distributed  model. 

In  [Sha86],  flat  forests  are  used  to  develop  a  linear  processor  constant-degree  MIS  algo¬ 
rithm  which  used  time  exponential  in  A.  A  forest  is  flat  if  each  of  its  trees,  when  properly 
oriented,  has  a  height  of  at  most  1,  and  any  zero-degree  node  in  the  forest  is  zero-degree  in 
the  input  graph.  Using  the  techniques  introduced  in  this  section,  we  can  find  a  flat  forest  of  a 
graph  by  proceeding  as  follows.  Find  a  maximal  pseudoforest  P  =  G(V,  E1).  Note  that  there 
exists  a  flat  forest  F  =  (V,  E"),  such  that  E"  C  E'.  Use  the  algorithm  3-Color-Pseudoforcst 
to  find  a  3-coloring  of  the  pseudoforest  P  and  subsequently  find  an  MIS  /  of  P .  Each  node 
v  J  adds  an  edge  (u,ri)  to  E"  such  that  u  £  /.  Each  node  in  /  with  no  adjacent  edges 
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in  E ",  but  some  adjacent  edges  in  E',  chooses  one  adjacent  edge  in  E'  and  adds  it  to  E" . 
The  graph  F  induced  by  the  edges  in  E"  is  almost  a  flat  forest  -  each  tree  has  a  height  of  at 
most  2.  Now  we  split  trees  of  height  2  in  F  into  trees  of  height  one  to  produce  a  flat  forest. 
All  operations  take  constant  time  except  the  operation  of  finding  the  3-coloring  of  P,  which 
takes  0(lg*n)  time.  Therefore,  we  can  find  a  flat  forest  in  0(lg*?r)  time  on  a  CREW  PRAM 
using  n  processors. 

5  Coloring  and  Matching  in  Planar  Graphs 

Euler’s  formula  [Har72]  implies  that  every  planar  graph  has  a  constant  fraction  of  nodes 
of  degree  6  or  less.  In  this  section  we  use  this  property  in  conjunction  with  the  techniques 
developed  above  to  construct  efficient  algorithms  for  coloring  and  finding  maximal  matchings 
in  planar  graphs. 

First  we  present  the  algorithm  7-Color-Planar-Graph  which  finds  a  7-coloring  of  a  planar 
graph  in  O(lgn)  time.  The  algorithm  is  shown  in  Figure  3.  The  first,  stage  of  the  algorithm 
partitions  the  nodes  of  the  graph  into  sets  K,  such  that  the  degree  of  any  node  v  e  V, 
in  G{V{  +  K'+i  4  Vi+2,...]  is  at  most  6  (Vi  consists  of  all  nodes  of  degree  at  most  6  in 
G\V  —  (Vo  U  Vj  U  . . .  U  Vi.,)]).  Then,  the  algorithm  colors  all  the  subgraphs  induced  by  the 
node-sets  {R,}.  These  graphs  are  node-disjoint  and  therefore  the  coloring  can  be  done  in 
parallel.  The  last  stage  ot  the  algorithm  adds  the  subgraphs  back  in  reverse  order,  updating 
the  coloring. 

Theorem  5  The  algorithm  7-Color-Planar-Graph  constructs  a  valid  7-coloring  using  n  pro¬ 
cessors  and  O(lgn)  time  on  a  CRCW  PRAM. 

Proof :  By  Euler’s  formula.,  at  least  a  constant  fraction  of  any  planar  graph’s  nodes  are  of 
degree  6  or  less.  Therefore,  the  first  stage  partitions  the  nodes  of  G  into  at  most  O(lgn) 
sets  V{.  We  use  concurrent  reads  and  writes  to  determine  whether  the  degree  of  a  node 
is  at  most  6,  and  hence  each  iteration  of  the  first  stage  is  done  in  constant  time.  By 
Theorem  3,  the  second  stage  uses  only  0(lg*n)  time.  In  the  ilh  iteration  of  the  third  stage, 
the  graph  G[VX]  is  already  7-colored  and  the  maximum  degree  of  each  node  in  V,  in  the  graph 
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Procedure  7-Color- Planar-Graph 

V  +-  V ; 

Vi,V2,...V5pgnl  <—  0; 

i  <—  0; 

while  V'  ±  0  for  all  v  G  V'  do  in  parallel 
if  Degree(u)  <  6 
then  begin 

Vi  -  Vi  +  v; 

F'  *-  V'  -  t;; 

end; 

i  *-  i  +  1; 
end; 

k  <-  i  -  1; 

for  all  0  <  i  <  k  do  in  parallel  ((  c°l°r 

Ei  <-  {{u, w}  |  v,  w  G  Vi  ;  {u,  w }  G  E}\ 

Ci  *-  Color-Constant-Degree-Graph(F„  E{)\ 

end; 

for  i  «—  k  down  to  0  do 
V"  -  V-,; 
for  j  ♦—  0  to  6  do 

for  all  v  G  V"  do  in  parallel 
ifC„=j 
then  begin 

Cv  <—  max{{0, . . . ,  6}  -  { Cw  |  w  G  V'\  {r,  w}  G  E}  }; 
V"  -  V"  -  v- 
V'  -  V'  +  v, 

end; 

end; 

end; 

end; 

end. 


Figure  3:  The  7-Coloring  Algorithm  For  Planar  Graphs 
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G[Vi  4.  vi+i  +  Vi+2  +  . . .]  is  at  most  6.  Only  constant  time  is  then  needed  to  add  in  V,  and 
produce  a  valid  7-coloring  of  G[Vi  +  U;+i  +  Vi+2  +  ...]•  Therefore,  only  O(lgn)  time  is  used 
in  all  three  stages.  | 

Remark:  If  at  the  first  stage,  instead  of  removing  from  the  graph  all  the  nodes  with  degree  of 
at  most  6,  we  remove  all  nodes  with  degree  of  at  most  c  times  average  degree  (for  c  >  1),  the 
algorithm  described  above  runs  in  polylogarithmic  time  for  any  graph  G  such  that  the  average 
degree  of  any  node-induced  subgraph  G'  of  G  is  polylogarithmic  in  the  size  of  G' .  This  class 
contains  many  important  subclasses  including  graphs  that  are  unions  of  a  polylogarithmic 
number  of  planar  graphs  (i.e.  graphs  with  polylogarithmic  thickness  [Har72]). 

Given  a  valid  7-coloring  of  a  planar  graph,  we  can  find  an  MIS  in  the  graph  by  iterating 
through  colors  as  in  our  Constant-Degree-MIS  algorithm.  With  concurrent  reads  and  writes, 
only  constant  time  is  needed  for  each  color  class.  Hence,  we  can  find  an  MIS  in  a  planar 
graph  in  0(lg  n)  time  on  a  CRCW  PRAM  using  a  linear  number  of  processors. 

The  deterministic  parallel  algorithms  for  5-coloring  planar  graphs,  described  in  [BK87] 
and  in  [Nao86],  use  f!(lg 3  n)  time  and  0(n3)  processors.  These  algorithms  require  a  large 
number  of  processors  because  they  use  Luby’s  MIS  algorithm  [Lub86].  Using  the  Constant- 
Degree-MIS  algorithm  described  in  the  previous  section,  we  can  reduce  the  number  of  pro¬ 
cessors  to  linear,  while  increasing  the  running  time  by  a  factor  of  0(lg*n)  [GP87a,  Gol87j. 

The  5-coloring  algorithm  presented  below  is  essentially  a  parallelization  of  the  sequential 
algorithms  in  [CNS81,  MST80],  Given  an  embedding  (which  can  be  computed  in  0(lg2n) 
time  [KR86]),  our  algorithm  runs  in  0(lg?rlg*n)  time  on  a  CRCW  PRAM  using  a  linear 
number  of  processors.  Given  a  graph  G  —  ( V,E ),  the  algorithm  finds  a  special  large  inde¬ 
pendent  set  /  of  nodes  in  G,  merges  some  of  the  neighbors  of  /  (as  described  below)  and 
removes  the  nodes  in  I  to  create  a  new  graph  G',  recursively  colors  G',  and  uses  this  coloring 
to  color  the  nodes  in  G. 

The  special  independent  set  I  is  constructed  as  follows.  Let  Q  be  the  set  of  all  nodes  in 
G  of  degree  greater  than  42.  Let  V4  be  the  set  of  all  nodes  of  degree  4  or  less.  Let  Vs  and  Vg 
be  the  set  of  all  nodes  of  degree  5  with  at  most  one  neighbor  in  Q  and  the  set  of  all  nodes 
of  degree  6  with  no  neighbors  in  Q ,  respectively.  Let  S  =  V4  U  V5  U  V6.  Let  Gs  —  (S,  Es)  be 
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the  graph  induced  by  the  nodes  in  S  in  the  graph  which  is  the  square  of  G[V  —  Q ]  The  set 
I  is  a  maximal  independent  set  in  the  graph  G‘  U  G[V  —  Q]-  Since  G3  and  G[V  —  Q]  are  of 
constant  degree,  we  can  find  I  using  the  procedure  Constant-Degree-MIS. 

In  order  to  construct  the  graph  G' ,  the  algorithm  proceeds  as  follows.  Start  with  G'  =  G. 
For  each  node  in  in  V5  we  find  two  of  its  non-adjacent  neighbors  that  have  low  degree  (42  or 
less),  and  merge  them  into  a  single  supernode.  For  each  node  in  I  n  V6  we  either  merge  three 
of  its  non-adjacent  neighbors  into  a  single  supernode,  or  merge  two  non-adjacent  pairs  of  its 
neighbors  into  two  supernodes.  The  embedding  information  is  used  as  in  [CNSS1,  MST80] 
to  find  the  neighbors  that  can  be  merged  while  preserving  planarity  after  all  nodes  in  /  are 
removed.  Then  we  remove  all  the  nodes  in  I  to  get  the  graph  G' . 

After  recursively  5-coloring  the  graph  G',  we  obtain  the  coloring  of  G  as  follows.  First  we 
color  all  the  nodes  of  G  that  correspond  to  nodes  or  supernodes  of  G'  with  the  same  color 
they  were  colored  in  G'.  Now  we  add  all  the  nodes  in  /  and  in  parallel  color  every  one  of 
them  with  a  color  different  from  the  colors  of  its  neighbors. 

In  order  to  bound  the  running  time  of  the  5-coloring  algorithm  we  need  the  following 
lemma,  which  is  similar  to  Lemma  3  in  [CNS81]. 

Lemma  6  The  size  of  S  =  V4  U  V$  U  14  is  at  least  a  constant  fraction  of  the  total  number  of 
nodes  in  the  graph. 

Proof:  Let  R  =  V  —  S.  Denote  by  s,  and  r;  the  number  of  nodes  of  degree  i  in  the  sets  S 
and  R,  respectively.  Let  r«  =  Si=rr«,  and  let  tq  =  £^43  r,-.  By  Euler’s  formula,  tq  <  ~n. 

We  prove  the  lemma  by  a  counting  argument.  Definitions  of  r5  and  r6  imply  that  2rs+r6  < 
X)i^43  iri.  Euler’s  formula  implies  that  3n  >  m,  therefore 
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>  53  isi  +  5r5  +  6r6  +  ^  iti  +  ^  ir, 


1=7  1=43 


>  isi  +  7 r5  +  7r6  +  ^  ir, 


i  ’.a  ■''(,8  -  .»> 


>  7?-5  +  7r6  4-  7  r, 

i=7 

6  6 

>  7(IZ  s«  +  r5  +  r6  +  r.  4-  rg)  -  7  ]T  5,-  -  lrQ 

i=i  i=i 

g 

>  In  —  7|S|  —  7  •  — n 
Thus  |5|  >  3^.  I 

Theorem  7  Given  an  embedded  planar  graph,  the  algorithm  5-Color-Planar-Graph  5-colors 
it  using  n  processors  in  0(lg*nlgn)  time  on  a  CRCW  PRAM,  and  0((lg*n  +  lg  A)  lgn)  time 
on  an  EREW  PRAM. 

Proof :  Correctness  of  the  algorithm  follows  from  [CNS81]  and  from  the  fact  that  the  nodes 
in  I  are  independent  in  G‘  U  G[V  —  Q] 

Lemma  6  implies  that  the  size  of  S  is  f l(n).  The  graph  Gs  has  a  constant  maximum 
degree  and  hence  the  size  of  the  set  I  is  f l(n)  as  well.  Therefore  the  depth  of  recursion  is  at 
most  O(lgn). 

On  a  CRCW  PRAM,  we  can  find  S  and  Q  in  constant  time  as  in  the  algorithm  7-Color- 
Planar-Graph.  The  construction  of  Gs  U  G  takes  constant  time  because  Gs  has  constant 
degree.  The  algorithm  Constant-Degree-MIS  finds  I  in  0(lg*n)  time.  In  constant  time 
nodes  in  I  can  merge  appropriate  neighbors  and  delete  themselves  from  G  to  form  G' .  Edge 
lists  in  G'  need  not  be  compacted  when  we  are  using  the  CRCW  PRAM.  After  recursively 
coloring  G' ,  we  can  color  G  in  constant  time. 

On  the  EREW  PRAM,  O(lgA)  additional  time  per  recursion  level  is  needed  since  we 
must  compact  edge  lists  of  G'  (so  that  the  set  S  in  G'  can  be  found  in  constant  time).  | 

Remark:  Chrobak,  Diks,  and  Hagerup  [CDII87]  have  recently  improved  the  result  of  Theo¬ 
rem  7  by  giving  an  algorithm  for  5-coloring  planar  graphs  that  runs  in  0(lg‘nlgn)  time  on 
an  EREW  PRAM  and  does  not  need  an  embedding. 


Using  the  techniques  described  in  this  section,  it  is  easy  to  construct  a  fast  algorithm  for 
finding  a  maximal  matching  (MM)  in  a  planar  (or  a  constant-degree)  graph.  As  in  the  7- 
coloring  algorithm,  the  first  stage  of  the  MM  algorithm  separates  the  nodes  of  the  graph  into 
sets  V{,  such  that  the  degree  of  any  node  v  £  Vi  in  G[Vt  +  K+1  4-  V<+2>  •  •  •]  is  at  most  6.  Then 
the  graphs  {G[Vi]}  are  colored  in  parallel.  The  second  stage  of  the  algorithm  recursively 
finds  MM  in  the  graph  G\V  —  Vj]  and  removes  the  matched  nodes  to  get  G\V'\,  where  V' 
is  the  set  of  the  unmatched  nodes.  The  graph  G\V']  has  no  edges  and  the  nodes  \\  in  the 
graph  G\V'  +  Vi]  have  maximum  degree  of  6.  Hence,  in  7  iterations  over  the  colors  of  C7[Vi] 
we  can  find  the  MM  of  G. 

Theorem  8  A  maximal  matching  in  a  planar  graph  can  be  found  in  O(lgn)  time  on  c 
CRCW  PRAM  using  a  linear  number  of  processors. 

Remark:  Using  Euler’s  formula,  we  can  extend  our  algorithms  for  7-coloring  and  MIS  in 
planar  graphs  to  graphs  of  boundedgenus  7.  We  apply  the  algorithm  7-Color-Planar-Graph 
as  before  when  there  are  at  least  07  nodes  remaining  in  the  residual  graph,  for  some  constant 
c.  The  Heawood  map-coloring  theorem  states  that  any  graph  can  be  colored  with  0(^/ 7) 
colors,  and  its  proof  implies  a  polynomial  time  algorithm  for  finding  such  a  coloring  [Har72]. 
Therefore,  when  less  than  C7  nodes  remain  in  the  residual  graph,  we  sequentially  color  it 
with  7)  colors.  With  additional  time  that  is  polynomial  in  7,  we  can  then  color 

the  graph  using  the  same  time  and  number  of  processors  as  for  7-coloring  a  planar  graph. 
Note,  that  the  above  algorithm  does  not  need  embedding.  The  related  result  for  MIS  on 
bounded-genus  graphs  follows  as  before. 

6  Lower  Bounds 

In  this  section  we  prove  two  lower  bounds  for  a  CRCW  PRAM  with  a  polynomial  number 
of  processors: 

•  Finding  a  maximal  independent  set  in  a  general  graph  takes  fl(lg n/ lglg  n)  time. 

•  2-coloring  a  directed  list  takes  ft(lgn/ lglgn)  time. 
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The  first  lower  bound  complements  the  0(lgn)  C RCW  PRAM  upper  bound  for  the  MIS 
problem  that  is  achieved  by  Luby’s  algorithm  [Lub86].  The  second  lower  bound  complements 
Theorem  2  in  this  paper. 

Theorem  9  The  running  time  of  any  MIS  algorithm  on  a  CRCW  PRAM  with  a  polynomial 
number  of  processors  is  fi(lgn/  lglgn). 

Proof:  Given  an  instance  of  MAJORITY,  we  construct  an  instance  of  MIS  in  constant 
CRCW  PRAM  time.  MAJORITY  is  harder  that  PARITY  [FSSS1],  which  was  proven  to 
take  f2(lgn/ lglgn)  time  on  a  CRCW  PRAM  in  [Bea86,  BH87].  Therefore  the  lower  bound 
claimed  in  the  theorem  follows. 

Let  xj,  X2,  •  • . ,  xn  be  an  instance  of  MAJORITY.  We  construct  a  complete  bipartite 
graph  G  =  (V,E)  with  nodes  corresponding  to  ‘O’  bits  of  the  input  on  one  side  and  nodes 
corresponding  to  T’  bits  on  the  other  side. 

E  =  {(*.  j)  I  x,  # 

To  construct  this  graph,  assign  a  processor  for  each  pair  1  <  i  <  j  <  n.  Then,  each 
processor  Pa  writes  1  into  location  MtJ  if  x,  ^  xj  and  writes  0  otherwise. 

A  maximal  matching  in  a  complete  bipartite  graph  is  also  a  maximum  one.  By  construct¬ 
ing  a  maximal  independent  set  in  the  line-graph  G'  of  G,  one  can  find  a  maximal  matching 
in  G.  To  construct  the  graph  G'  assign  a  processor  P{jh  for  each  distinct  i,j,k  <  n.  Each 
P,jk  writes  1  into  location  if  MtJ  =  Mjk  =  1  and  0  otherwise. 

The  MAJORITY  equals  to  1  if  and  only  if  there  is  an  unmatched  node  i  £  G  such  that 
x,  =  1,  which  can  be  checked  on  a  CRCW  PRAM  in  constant  time.  | 

Theorem  10  The  time  to  2-color  a  directed  list  on  a  CRCW  PRAM  with  a  polynomial 
number  of  processors  is  fl(lg  n/  lglg  n). 

Proof :  We  show  a  constant  time  reduction  from  PARITY  to  the  2-coloring  of  a  directed  list. 
First,  we  show  how  to  construct,  in  constant  time,  a  directed  list  with  elements  corresponding 
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to  all  the  input  bits  x,-  with  value  of  1.  Let  x1(x2, . . .  ,  xn  be  an  instance  of  PARITY.  We 
can  assume  w.l.o.g.  that  xj  =  1.  With  each  input  cell  Mi  (that  initially  holds  the  value  of 
X,),  associate  a  processor  P,,  a  set  of  processors  Pfk  with  each  index  i,  1  <  k  <  j  <  i,  a  set 
of  cells  Mf,  0  <  j  <  i,  and  a  cell  Mf.  Initialize  all  cells  that  do  not  store  input  bits  to  0. 

In  one  step,  each  processor  Pfk  reads  the  value  of  M,-k  and,  if  it  equals  to  1,  writes  1 
into  Ml,  effectively  computing  the  OR-function  on  the  input  values  xt_j,xt_J+1, . . . ,  x,-_ 
Assign  a  processor  Pf  to  each  Mf ,  1  <  j  <  i.  Each  processor  Pf  reads  Mf  and  Mf+l  and 
writes  (t  —  j)  into  Mf  if  and  only  if  Mf  ^  Mf~l .  It  can  be  seen  that  for  all  1  <  i  <  n,  Mf 
holds  max{;  |  j  <  i,Xj  =  1}. 

We  have  constructed  a  directed  list  with  elements  corresponding  to  all  the  input  bits  x- 
with  value  of  1.  Assume  this  list  is  2-colored.  Then  PARITY  equals  to  1  if  and  only  if  both 
ends  of  the  list  are  colored  with  the  same  color,  which  can  be  checked  in  constant  time.  | 


7  Conclusion  and  Open  Problems 

We  have  presented  a  fast  technique  for  breaking  symmetry  in  parallel  and  have  shown  how 
to  apply  this  technique  to  improve  the  running  times  and  processor  bounds  of  a  number  of 
important  parallel  algorithms.  We  believe  that  the  efficiency  of  this  technique,  combined 
with  the  simplicity  of  its  implementation,  makes  it  an  important  tool  in  designing  parallel 
algorithms. 

Our  results  motivate  the  following  open  questions. 

•  We  have  proved  a  lower  bound  for  MIS  in  general  graphs.  What  is  the  lower  bound 
for  MIS  in  planar  graphs  ? 

•  Beame  has  proposed  the  following  algorithm  for  coloring  rooted  trees  of  constant  degree 
on  PRAM.  Run  the  algorithm  3-Color-Rooted-Tree  for  0(lglg*?r)  steps.  Next,  each 
processor  collects  the  colors  of  all  the  descendants  on  distance  0(lg*n)  or  less  and  uses 
this  information  and  a  precomputed  lookup  table  (of  size  0(lg*/?.  lg  lg*n))  to  compute 
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its  final  color.  Given  an  Q(lg*Ji)  preprocessing  time,  we  can  precompute  the  lookup 
table;  after  this  preprocessing  step,  the  time  to  3-color  a  tree  (or  a  pseudoforest) 
will  be  0(lglg*n).  Is  it  possible  to  3-color  a  tree  in  o(lg*n)  time  on  PRAM  with  no 
preprocessing? 

•  Can  we  compute  an  MIS  in  general  graphs  in  o(lgn)  time  ? 

•  Recently,  several  papers  [CV86,  Rei85]  that  present  parallel  algorithms  with  “optimal 
speedup”  have  been  published.  (The  measure  of  optimality  used  in  these  papers  is  how 
close  is  the  processor-time  product  of  the  parallel  algorithm  to  the  running  time  of  the 
fastest  known  sequential  one.)  The  problems  we  have  been  studying  in  this  paper  can 
be  solved  in  linear  sequential  time,  but  the  processor-time  products  achieved  by  our 
algorithms  are  superlinear  (by  a  lg'n  or  a  polylogarithmic  factor).  How  can  one  reduce 
the  processor  requirements  of  the  algorithms  without  increasing  their  running  time 
in  order  to  achieve  linear  time-processor  products?  Also,  will  this  improved  processor 
efficiency  induce  significantly  more  constraints  on  the  model  as  compared  to  our  current 
algorithms  ? 
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